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Abstract 

Expression of the light-harvesting complex protein genes {Lhc) is under the control of a circadian clock. To dissect 
the molecular regulatory components of the circadian clock a promoter deletion analysis of four tomato Lhc genes 
was performed in transgenic tobacco plants. The important 5'-upstrcam promoter regions are present at different 
positions relative to the transcription Start site of Lhc bl*i, bl*2, Lhc a3 and Lhc a4. A short sequence of 47 
nucleotides is necessary for conferring circadian Lhc mRNA oscillations. Sequence alignment of the specified 
promoter regions revealed a novel motif 'CAANNNNATC*. This motif is conserved in 5'-upstrcam regions of 
clock controlled Lhc genes and overlaps with a sequence relevant in phytochromc mediated gene expression. 

Abbreviations: Lhc alb, genes encoding lighfrharveating complex proteins (formerly cab, chlorophyll o/i-binding 
proteins); DD. continuous darkness; LD, light/dark conditions 



Circadian rhythms have been described in nearly every 
eukaryotic organism as well as in some prokaryotcs 
[19]. Despite the universal appearance and accu- 
mulated knowledge about the phenomenon of such 
rhythms, the molecular mechanism of the circadian 
Clock is still not understood. Evidence is accumulating 
that indicates that the transmission of the biological 
clock occurs by feedback and autoregulation [2$, 4], 
To shed some light on the basic machinery we started 
to investigate possible components necessary for the 
circadian mRNA accumulation of the light-harvesting 
complex proteins {Lhc) of plants. 

The light-harvesting complexes of plants arc orga- 
nized as protein-pigment units in the thylakotd mem- 
branes of chioroplasts which enhance the probability 
for light quantum absorption and focused channelling 
of the energy to the photosynthelic reaction centres [1, 
1 6]. The proteins and pigments of these complexes and 
their respective genes have been intensively investi- 
gated in plants. Besides several important findings it 



turned out that the synthesis of the proteins and the 
accumulation of respective mRNAs are under the con- 
trol of a circadian clock [24, 27]. Interestingly, all 19 
members of the tomato Lhc gene family exhibit this 
characteristic expression partem [18]. Based on this 
similarity a common control mechanism is expected 
to function at the level of transcription of each tomato 
Lhc gene, for example cis- and tains-regulatory ele- 
ments. 

Several distinct cXr-regulatory motifs have been de- 
lected for plant genes [22, 29]- A prominent element 
is the 'ACGT\ the G-box, first described by Giuliano 
et al. [13], which is present in the promoter of plant 
eencs encoding very different proteins [31]. However, 
the fronj-acting factors binding to this motif belong 
to the same bZip type [3, 11, 15], Another sequence 
('CCTTATCAT') has been described and correlated 
with light-responsive expression of several plant genes 
[14] as well as the 'GATA' motif (1-box). The lat- 
ter, described by Castresana et al [8], is present 
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iTfft^ /, Steady-stale toRNA levels of (he tomato Uic bl"! (en* I A) gene in transgenic tobacco. A. Transgenic tobacco pfents carrying 
Uic deleted tomato Lhc gene promoter (A-109I, A-701, A-159, A-43) were grown in lighf/darfc (light: 0fi:0(M&:0O) and continuous dart 
conditions. Leivcs were harvested «i indicated time points and tomato IAc mRNAa were detected with the primer extension technkw. Tlie 
lengA of the extended primer is 70 bp. B. toRNA level* of deletion construct* were quantised (fool per me RNA) and data of rcprescntetive 
tofaavco lines are presented. ^ 
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in the promoter of several light-harvesting complex 
proteins <LHCP) [12, 25], of the small subunit of 
Rubisco f9] and in the CaMV 35S promoter [21]. In 
the case of many Lhc gene promoters this motif is 
two to four times repeated. The nucleotide distances 
between the 'GATA* sequences are highly conserved 
[25). Since the discovery of this motif and its abun- 
dant appearance a role as a regulatory unit had been 



postulated. Variation of the nucleotide sequence of this 
motif indicated that the *GATA' clement modulates the 
transcription positively rather than negatively [12]. A 
correlation Of light- and/or tissue-specific expression 
wilh the presence of this motif was not observed. A 
possible function in mediating circadian rhythmicity 
of Lhc mRNA accumulation Was discussed since a 
short promoter fragment of the Lhc bl*l (cab2) gene 
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tffwv 4, Steady-male mRNA levels of the tomato Lhc aA (cab II) gene, in inrnigenic tobacco. The Lhc *4 promoter regions (-251 to +4 or 
- 1! 9 to +4) wwc cloned into the p01 101.2 reporter vector from Clontech. Transflenie tobacco plants containing the constructs were ffown m 
light/dark (lighf: 06\«0<M8:00) and continuous dp* conditions. Leaves were harvested at indicated time points and the tfueunraidasc mRNA 
levels were detected by northern blot analyaia. The length of the gus mRNA is ca. 1 .9 kb. For abbreviations, we Figure 5. 



of Arabidopsis thaliana containing this motif confers sequence in tomato was suggested 10 be a component 
eifcadian Lhc expression in transgenic plants [2]. In of the signal transduction chain of the cireadian clock 
addition, the sequence *ACTT flanking the 'GATA' [6]. 
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F&ure J. Summmry of the deletion analgia. Cireadon oscillations of tomato Lhc inRNAs m transgenic tobacco arc indicated by the 'white* 
clock. No rhythmic the mRNA accumulations were derecxed lv tobacco lines transfomied with tha tompio Lhc bl* 1 A- 159 and A-43 and the 
Wc *3 A-231 and deletion consmicts and are indiCBted by the Shaded" clock. T, 'TATA'; C, "CCAAT"; 1, 'GATA'; L, 4 CCTTaTCAT' ; 
G.'ACGT'i lOnucjhotif, 'CCANNNNATC*. 



Since all members of the tomato Lhc gene fam- 
ily express the typical circadian mRNA accumula- 
tion pattern a computer-based search was initialed to 
screen for a conserved ciy-regulatory clement in the 
5'-upstream Sequences. This analysis was performed 
with long DNA sequences (e,g, 400 nucleotides) but 
railed to identity a possible candidate which may be 
responsible for the mRNA oscillations [25]. Therefore 
the primary goal of the presented experiments was 
to explore the 5'-upstream sequences of the tomato 
Lhc genes by promoter deletion analysis in transgenic 
tobacco plants. 

Promoter deletion analysis 

Tobacco leaf discs were transformed via Agrohac- 
terium tu7nefaciens~med\Ale$ gene transfer with pro- 
moter deletion constructs of four tomato Lhc gcncs r 



Lhc bl*l (cab 1A; accession numbers M14445, 
M30616; promoter. X60922), Lhc bl*2 (cab IB, 
M14443; promoter: XG0923), Lhc a3 (cab 8; 
XI 5258). Lhc a4 (cnh 1 1, X57706). About fifty pri- 
mary iransformants of each deletion construct were 
regenerated and tested for Lhc mRNA expression lev- 
els. Individual plant lines with high expression levels 
were chosen for further analysis. 

To find out which of the deleted promoters medi- 
ate the circadian oscillations of tomato Lhc mRNA, 
the tobacco transfomiants were grown in LD and DD 
conditions. Leaves were harvested at appropriate time 
points and steady-state mRNA levels were determined 
by northern blot (Figure 4 for a description of meth- 
ods, sec [24]) or primer extension (Figures I A, 2, 3A) 
analysis and quantitated by primer extension analysis 
(Figure IB and 3B, for a description of methods, see 
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Table /. Sequence motif presence* in S'-upsituun regions of Lhc gene a. 



Lycaptrsicon csculmtum 


(-701 to -159) 


Lhcbn 


-254 


CAAACATATC 




(-152 to -1) 


Lhcbl*2 


-B7 


CAATGAGATC* 




<-27fl»-23l) 


Lhca3 


-237 


CAACJAGTATC 






UK** 


-116 


CAACTCAATC 








-92 


CAAAAAATC 


Triticum aesrivum 


(-211 lc»- 90) 


U\cb\ 


-17$ 


CAAGAGTATC 








-140 


Caatggcatc: 


Arabidopjis Oiallana 


(-111 tn-74) 


Lhcb) w \ 


-103 


caaaaatc 








-87 


CAATGAAI VA* 




(-SOOlo-l) 




-371 


CAATGG AA'l C 








-253 


ca aata aCitatc 








-174 


CAATOAAAAAATC 








-106 


caaaatc 



"Sequence overlaps with ihe 'CAAT" hox. 



[18, 25]. The mRNA levels of the tomato Lhc bl A l 
deletions A-1091 (Nos. 4, 1 1, 38) and A-701 (Nos. 4, 
9, 26) clearly oscillate in LD with a period of ca. 24 h. 
Only vety little or no tomato Lhc mRNA is detectable 
at 06:00, maximum mRNA levels were reached at 
10:00 and levels decrease thereafter (Figure 1 A and 
B). The phase of the circadian rhythms arc nut altered 
in different deletion constructs. In constant darkness 
the amplitudes are significantly reduced but transcripts 
increase after the night trough at the appropriate time 
may indicate that an endogenous oscillator influences 
the expression of the tomato Lhc gene. 

In contrast to the long promoter constructs it is very 
unlikely that the tobacco lines with the short promoter 
regions of UrcblM A-159 (No$. 2. 42, 47) and A-43 
(Nos. 24, 28, 33, 34) exhibit circadian mRNA ac- 
cumulation patterns (representative data in Figure 1). 
The mRNA levels fluctuate only twofold at low ex- 
pression levels, however a constant period length of 
ca, 24 h was not measured in LD and DD, particu- 
larly well documented after quantitation (Figure IB, 
right panels). These results clearly show that the short 
S'-upstrcara regions Ore sufficient for a basal mRNA 
accumulation and strongly suggests that sequences up- 
stream of — 159 are necessary for circadian mRNA 
accumulation of the tomato Lhc bl*l gene. Further- 
more, It is interesting to note that the expression level 
of the short constructs do not decrease in DD, while 
this is usually ohserved for complete Lhc genes which 
arc in their native 3' and 5' nucleotide surrounding. 

With the Lhc a3 deletion constructs similar results 
were obtained as with Lhc blM (Figure 2). Deletion 
A-322 (No, 6) and A-27K (No. 14) exhibited circa- 



dian Lhc mRNA accumulation while the mRNA of 
the deletion constructs A-23 1 (Nos. 24, 36) and A- 
148 (Nos. 16, 25) reached almost constant levels in 
LD and DD and no oscillations with a denned period 
length could be observed. Based on these data it is 
likely that a region of 47 nucleotides (-278 to -231) 
is necessary for circadian Lhc a3 mRNA oscillations. 
The low expression levels of the Lhc a3 gene in trans- 
genic tobacco prevented a quantitation with the primer 
extension analysis. 

Investigation of the Lhc h\*2 (Figure 3) and Lhc 
a4 (Figure 4) genes in tobacco revealed circadian 
Lhc mRNA accumulations for constructs that con- 
tain long as well as short 5'-upstream regions (Lhc 
bl*2:A-793,Nos. 11, 22; A-323,Nos. 13, 17; A-152, 
Nos. 1, 4, 21, 24, Lhc a4: A-251, No. 60, A-119, 
No. 37, 74). The mRNA constructs of the Lhc bl*2 
gene could be quaniitatcd with the primer extension 
analysis and representative results are depicted in Fig- 
ure 3B» A circadian oscillatory pattern is obvious for 
each deletion construct. It should be noted that the 
short 5'-upstream regions of Lhc bl*2 and Lhc a4 are 
apparently sufficient to confer circadian rhythm! city. 
This is a surprise since in contrast such short promoter 
regions of the Lhc bl*l and Lhc a3 genes exhibit an 
almost constant expression pattern (Figures 1 and 2). It 
therefore can be concluded that afferent 5' -upstream 
regions (relaiive to the transcription start sites) of the 
tomato Lhc genes are necessary for conferring circa- 
dian rhythmicity. The results of all deletion constructs 
are summarized in an overview in Figure 5. 
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The promoter deletion analysis indicated regions 
which are necessary for the transmission of the cir- 
cadian clock in transgenic plants. If a conserved 
'dock' regulatory element exists it should be local- 
ized within these respective regions. Therefore the 
following 5'-upstrcam sequences were aligned: Lhc 
blM, -701 to -159; Lhc bl*2, -152 to — I; Lhc 
a3: -278 to -231; Lhc a4, -119 tn -1. A novel 
sequence motif appear* to be conserved in all four 
tomato Lhc 5'-upstream regions (Tabic 1). The se- 
quence is 'CAANNNNATC, it is composed of three 
conserved nucleotides, four variable nucleotides fol- 
lowed by three conserved nucleotides. Identical or 
very similar motifs arc present in short 5' promoter re- 
gions of circadian controlled Lhc genes of Anibidvpsis 
thatiana {Lhc bl* J (cab 2), -1 H to -74 [2, 7J; Lhc 
bl*2 (cab 3), -371 [23] and Triticum aestivum (Lhc 
bl (cab 1), -211 to -90 [10]). Consistent with the 
hypothesis that this motif plays a role in transmitting 
circadian rhythmicity is its absence in the promoter 
of the non-eircadian expressed Lhc b gene of Pinus 
contorta (—1000 to -hi [5]). Also ioutresting in this 
context are the investigations of Wang ei ai [30]. Fol- 
lowing the phytochromc- regulated light signal trans- 
duction pathway of the A. thatiana Lhc hi "3 gene a 
specific protein, CCA I, a myb-related transcription 
factor, was identified. Interestingly, the binding site 
of this transcription factor overlaps with the motif 
which we found to be conserved in the 5'-upstream 
regions that are necessary for circadian expression of 
the four tomato Lhc genes. These findings support the 
emerging idea that light perception and clock func- 
tion arc closely related, a hypothesis recently staled 
by iCay [17]. His conclusion was based on the pres- 
ence of the PAS domains in clock-related proteins such 
as PER (period, Dn/sophila) and WC (white collar, 
Ncurospora) as well as in photosensory proteins such 
as photochrome (PHY, plants) and the bacterial blue- 
light receptor (PYP, photoactive yellow protein [20]). 
Isolation and characterization of the matrt^-rttgulatory 
factors) binding to the novel 4 CAANNNNATC k mo- 
tif are on the way and will bring further knowledge 
regarding the hypothesis that the signal transduction 
pathways of the light and the circadian clock cross-talk 
or converge. 
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